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O ■ Abstract 

I We demonstrate that for the /c-variable theory T of a finite structure (satisfying certain 

■ amalgamation conditions) , if finite models of T can be recovered from diagrams of finite subsets 

of model of T in a certain "efficient" way, then T is rosy - in fact, a certain natural Ho-categorical 
completion T'™ of T is super-rosy of finite J7^-rank. In an appendix, we also show that any k- 
I variable theory T of a finite structure for which the Strong L'^-Canonization Problem is efficient 

I 1 ■ soluble has the necessary amalgamation properties up to taking an appropriate reduct. 

-4— > ' 

<^ ■ Introduction 

This article is the second of a three-part series (with [lOJ and examining the model-theoretic 
geometry of an algorithmic problem - the L'^-Canonization Problem. Here, denotes the fragment 
i of first-order logic consisting of formulas with at most k distinct variables, free or bound, and it can 

be shown that for any finite structure A4 (in a finite relational signature g), its complete A;-variable 
00 ■ theory Th^{M) is finitely axiomatizable in a uniform way. The L'^-Canonization Problem asks us 

I to devise an operator F that takes the theories Th^{M) to finite models F{Th^{M)) 1= Th^{M) - 

\ thus, defining a "canonical" model of each complete /c-variable theory that does have finite models. 

Composing the canonization operator F with the mapping M i— )■ Th^{A4), the operator F{Th^{ — )) 
can be thought of as a solution to a natural relaxation of the Graph Isomorphism Problem, the status 
of which is a major open problem in complexity theory (see [18j for an old survey). 
, . Although this problem is known to be unsolvable over the class of all L'^-theories (in particular, 

ijjj ■ over all L^-theories of finite structures - see|8|), it has been shown that for the class of stable 

I L^-theories and for the class of super-simple L'^-theories with trivial forking dependence (with 

additional amalgamation assumptions), the L'^-Canonization Problem is recursively solvable (see [5j 
and [6], respectively). In both of those cases, resolution of the L'^-Canonization Problem is reduced 
to showing that certain complete first-order theories associated with the original L^-theories have 
the finite sub-model property. Thus, after the heavy lifting done by the model theory, the algorithm 
itself is extremely simple-minded. Moreover, the analyses in [5j and [6j do not assume a priori 
that the L'^-theories in question certainly have finite models. In contrast, in this series of articles, 
we will examine the L^-Canonization Problem for L'^-theories that do certainly have finite models. 
Moreover, we will consider implementation of L'^-Canonization operators in a significantly restricted 
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model of computation, leading to a notion we call "efficient constructibility." Finally, we will expand 
the original L'^-Canonization Problem to take whole L'^-elementary diagrams as input, which allows 
us to work with individual L'^-theories in a non-trivial way. Thus, the goal of this series of articles 
is to prove the following: 

Main Theorem. Let A^o be a finite structure, and let K be the class of all finite models of 
T = Th^{A4o). Assuming that K has adequate amalgamation properties, let T^^™ be the complete 
first-order theory of the direct limit of K. Then the following are equivalent: 

1. t'™ is super-rosy of finite C/^-rank. 

2. K is rosy. 

3. The i('-Construction Problem is solvable in a certain "relational" model of computation (based 
on [T] and exposed in [9j): 

Given M. [A] for some (imphcit) M. e K and AC M0 
Return M e K such that AC N and J\f[A] = M[A]. 

In |10) . we established the content of parts 1 and 2 of the Main Theorem, and showed that, in 
fact, 1 and 2 are equivalent. In this article, we will prove the implication 

"Efficient constructibility" 1,2. 

We save for another day (i.e. |llj ) the demonstrations that (1) a solution in the relational model 
of computation induces efficient constructibility in the sense of this article (immediate from the 
definitions), and (2) that the X-Construction Problem is in fact solvable by a relational Turing 
machine whenever K is rosy (so, super-rosy of finite U^-iank 

Theorem 0.1. Let A^o O' finite structure, and let K he the class of all finite models of T = 
Th^{Mo). Assuming that K has adequate amalgamation properties, let T^™ he the complete first- 
order theory of the direct limit of K . If K is efficiently constructihle, then K is rosy (super-rosy of 
finite U^-rank). 

1 Background and the Main Setting 

The contents of subsections ll.il and II . 2 1 is taken verbatim from the companion article, |10| . Naturally, 
we require the settings to be identical. In the last subsection, we recall the necessary facts about 
t)-independence and rosiness that were established in jlOj . 

1.1 Finite- variable Logics 

Finite- variable fragments of first-order logic, L'', were formulated by many authors independently 
(e.g. [17], but our main references have been |14j and |13|). The importance of and its infinitary 
extension ^ in finite-model theory is difficult to overstate. For our purposes, is satisfying 
because a "complete" L'^-theory - that is, complete for L'^-sentences - can have many non-isomorphic 
finite models, which is surely a prerequisite for bringing classical model-theoretic ideas to bear in 
finite-model theory. 

^Here A1[A] denotes the induced substructure of A4 on the subset A, so A^[y4] does not carry any further infor- 
mation about A4. 
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Definition 1.1. Let ^ be a finite relational signature. Assume k > an(g) = max{ari(i?) : R & g} 
and k >2. 

1. Fix a set X = {xi, ...,Xk} of exactly k distinct variables. Then, = is the fragment of 
the first-order logic L = Lg keeping only those formulas all of whose variables, free or bound, 
come from X. li V = {xo,xi, ■■■,Xn, •••} is the infinite set of first-order variables understood 
in the construction of the full first-order logic, then L'^ = IJ ■ ^ S where is 
the set of /c-element subsets of V. 

As usual, we write ip{xi, ...,Xk) to mean that the set of free variables of is a subset of 
{xi, ...,Xk}, but not necessarily identical to it. 

2. For a ^)-structure A4, the /c- variable theory of A4, denoted Th^{Ai) is the set of sentences if 
of L'' such that Ai \= (f. Note that Th^{A4) is complete with respect to /c-variable sentences 
in that either ip G Th^{M.) or -k/? G Th^{Ai) for every /c-variable sentence 99. 

3. For a A;-tuple a G M*^, we set tp^{a;M) = {ip{xi, ...,Xk) e : M \= 93(a)} and if T = 
Th^{M), then 5^(r) = {tp^{a]M) : a G M^] . 

It can be shown - in a number of ways - that for a complete L'^-theory T, T has a finite 
model only if S^{T) is finite. All of those methods also show that S^{T) is an invariant of 
T = Th^{M) rather than M itself - that is, if Th^{N) = T for some other £)-structure M 
(equivalents, if M =^ M), then {t/(6; A/") : h G N^] = S^{T), too. Finally, it can also be 
shown that if A4 is finite, then Th^{Ai) is finitely axiomatizable, and in fact, the mapping 
M I—)- Th^{A4) is computable in RcI-Ptime (see [T]). This latter fact is the basis for our 
notion of efficient constructibility. 

4. Let be a ^-structure, and let B C M. Then for X G (^) as above and e : X ^ B U X, we 
define L-^{e) to be the set, 

[ip{e{xi), ...,e(x„)) : ip{xi, G L^] . 

Then, 

L\B) = [J U ^''(-)' 

X e:X^BUX 

where, again, X ranges over (^). In particular, L^{B) is not obtained by adding B as 
collection of constant symbols to the underlying signature. Finally, we define 

dia(^{B; M) = G L^{B) : ip, i-p has no free variablesj . 

We note that the object diag^{B]Ai) does not carry the whole of M. with it; this observation 
is crucial in understanding why the Strong L'^-Canonization Problem (below) is non-trivial. 

Having defined the fc-variable logic, we define the -Canonization Problem by specifying what 
would amount to its solution. Thus, an -Canonization operator (in the signature q) is a polynomial- 
time computable mapping F : fin[£)] — )• fin[^] such that, 

L F{M) =^ M for all M G fin[e]. 



3 



2. If M =^ M, then F{M) = F{M). 

(Here, fin[^>] denotes the class of ah finite ^)-structures.) The map F selects a representative of each 
^'^-class, a "canonical" finite model of each complete /c- variable theory that does actually have finite 
modelsQ For /c > 3, this is entirely impossible |8], so the natural move is to ask for canonization 
operators F : K ^^[q] for sub-classes K C fin[g]. As stated, the operator F really acts on theories, 
not structures, and this seems to be a serious impediment to studying the question from the point 
of view of "geometric" model theory; we therefore move again to a related problem which yields a 
better "grip" on the model theory of each L'^-theory. Here, we call this the Strong L*"'- Canonization 
Problem: 

Given dia(^{A;M) for some (implicit) M G fin[£i] and AC M, 

Return a model M' \= Th^{M) such that A C M and dia^{A;M') = diag^{A;M). 

(As usual, diac/' {A; A4) , the L'^-elementary diagram of A in A4, actually contains Th^{A4), as the 
sentences of Th^{Ai) are precisely the 0-ary formulas satisfied in Ai by 0-tuples from A.) Of course. 
Strong L*^-Canonization initially appears to be a significantly harder problem than the original L^- 
Canonization Problem, but we note that previous positive results on the L^- Canonization Problem 
for k = 2 and over restricted classes of L'^'-theories have always yielded solutions of the Strong 
L'^-Canonization Problem with no additional toil - see [5] and |15| . We derive another advantage 
from the move to Strong L*^-Canonization in that it can be sensibly considered around a single fixed 
L'^-theory without trivializing the problerrH. 

1.2 Main Setting 

The context for the the Main Theorem (for this and the two companion articles) is the following. 
Let 9Jto be a countable Ko-categorical structure in a finite relational signature g. We assume that 
T/i(9Jto) has the finite sub-model property in the following strong sense: 

For every sentence ip and every finite Aq C QJIq, ^ Th(jMo) if and only if there is an 
algebraically closed finite A C 9Jto such that Aq C A and A\= ip. 

Under this assumption, the fc-variable theory T = T/i^(9Jto) has finite models of all sizes. We will 
also make two additional universality assumptions about QJIq with respect to finite models of T. 

Ul. For B C lOTo, define 

clX^{B) = B U |a e 9Jto : dia^{B; 9Jto) N tp^{a/B) is algebraicj , 

clo^(i?) = B, ct^,{B) = clUctiB)), cl'{B) = \Jct{B). 

n 

We require that for aU B Cfin TIq, ad{B) = cl^(S) and, ii B = acl(S), then tp{B) = 
tp^{B)UTh{mo). 

^In particular, F is supposed to be invariant with respect re-encodings of a given structure. In the model of 
computation that we will work with in these articles, which works directly with unencoded structures, this issue will 
actually disappear entirely. 

^There is an asymmetry in fixating on a single L^-theory T. If we were initially studying classes K C tin[g] over 
which (Strong) L^-Canonization has a solution, there is an implicit expectation of uniformity in that we would have 
expected a single algorithm to serve as an L^-Canonization operator over the entire class. By fixing a single theory 
T, we allow ourselves to exploit additional properties of it that might be hard or impossible to read-off from its a 
finite presentation. 
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U2. T ^ T/iv(9?To). 

This assumption has two invaluable consequences: 

(a) For every sufficiently large finite model A \= T, there is an L'^-elementary embedding 

A mo- 

(b) For large enough finite B \= T, for any A C B such that A = cl^(A), and any Jq-.A—^ DJIq 
~ a partial L'^-elementary map - there is an L'^-elementary embedding f : B ^ DJIq 
extending /q. 

From all of these assumptions and appealing to [3] (Lemma 19 of that article, combined with the 
assumption on algebraic closures above), we may associate with T a direct limit - a countable 
structure 9JT with the following properties: 

CI. 9Jt also satisfies the universality conditions expressed above. 

C2. For any finite tuple a from m, tp(a) = tp''{a) U Th{m). 

C3. Let Ko be a set of representatives of every isomorphism- type of finite L'^-elementary sub- 
models of 9JIo- Then for any non-principal ultrafilter ^ on Kq, 971 = UKq/^. 

Abusing the terminology of [3] slightly, we call the theory T^^™ = Th(J)Jl) the canonical completion 
of T. 

Definition 1.2. Let K denote the class of (up to isomorphism) finite L'^-elementary sub-models 
of dJl quasi-ordered by the L'^-elementary substructure relation. That is to say, starting with the 
set of finite L'^-elementary sub-models of Wq, we close under isomorphisms and obtain an object 
that is much like an abstract elementary class (AEC). By definition, the model theory of 
the fc- variable theory T is, effectively, just the model theory of the class {K, ^'^)EI For Ai ^ K and 
C C M, we define, 

K^C;M) = {aA G : C C AT, diag^{C;J\f) = dtag^{C;M)^ . 

Writing Kf^Q.j^-^ over and over again is rather cumbersome, so we often write Kc - taking M to be 
clear from context or assuming that a fc-variable diagram for C is fixed. Also, we will often identify 
the pair {C;M.) both with the set C and the diagram dia(/^{C; A4). 

Observation 1.3. There is a finite set E of 0-definable equivalence relations such that for any other 
0-definable equivalence relation E{yi,y2), there is a boolean combination 97(^1,^2) of members of 
E such that T^™ 1= E(lj^,y2) O 99(^x7^2)- particular, let 9JTe be the reduct of Tl'^^ consisting of 
the home-sort and the finitely many imaginary sorts corresponding to equivalence relations in E; 
then T/i(9Ke) eliminates imaginaries. 

Moreover, each equivalence relation in E is expressible as boolean combination of formulas of 
L^. Thus, for ^ 9JT and a,b e A, A\= E{a,b) if and only if 9Jt 1= E{a,b); this is true even when 
^ is a finite substructure. Thus, each finite L'^-elementary substructure A of 9Jt, naturally expands 

*In fact, if T is any fc- variable theory whose finite models have the Joint-Embedding and Amalgamation Properties 
indicated by the two consequences of U2, it is still possible to generate a limit model as above; given DJl, one can 
then go through this entire process with 9Jlo = 3Jt. Thus, we can also view the model theory of Th{DJl) as a by-product 
of the model theory of T or of {K, :<''). This is the approach taken in 0. 
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to a substructure of carrying all of the relevant information from W^^. In this way, we 
also obtain an expansion of the class {K,^^) (see below) to Xe, and where the ambiguity is not 
dangerous, we will identify K and K-^,. 

In model theory, it is a common practice it is a common practice to Morley-ize a theory under 
study - adding relation symbols for every 0-definable relation so that the theory automatically has 
quantifier-elimination. Indeed, subsequently taking the reduct to the language consisting of only 
the new relation symbols still does not change anything about the theory essentially. Here, we 
implicitly take a similar approach, working in a language in which the /c-variable /j-types are named 
as relation symbols as are the projection maps to the finitely-many needed imaginary sorts. As 
a point of interest, while this move is already justified in our Main Setting, it is possible to show 
that a "capped" L'^-theory - in which every diagram of a finite subset of a model extends to a finite 
model - automatically falls into our Main Setting. This is discussed in Appendix lA] below. 

1.3 Rosiness, model-theoretic independence relations and geometric elimina- 
tion of imaginaries 

To conclude this section, we recall some definitions and facts from |10| about super /rosiness of L^- 
theories and their limit theories consistent with Main Setting. We recall, in particular, we essentially 
defined the phrase, "X is rosy," for K the class of finite models of T, to mean, "T^™ is rosy" (this 
is in contrast to the developments in |12) and [6]). For the sake of brevity, then, we make several 
definitions in the context of the limit theory and its models, but we don't mention their translations 
into the context of the underlying class K itself. 

Definition 1.4 (Model-theoretic independence relation). Let T, K and t'™ be as in the Main 
Setting, and let 9JT (= 971*^^) be a countable model of T^™. An independence relation is a subset 
of {{A, B, C) : A,B,C Cfin 9Jt} satisfying the following for all A, B, C Cfin 9Jt: 

Invariance: If a G Aut{Tl), then AXcB iff cr[yl] J^^[c]cr[i?]. 

Preservation of algebraic dependence: If A^cB, then acl(^C) fl a,c\{BC) = acl(C). 
Existence: aXcC. 

Extension: If A-^l^qB and BC C D Cfin ^i then there is an A' =bc A such that A'-S!^ cD. 
Symmetry: A-X cB if and only if B-X cA 

Transitvity: If Co C C C 5, then AXcqB iff AXc^C kAXcB. 

Local character: For every finite family S = {5'o, Sn-i} of sorts of M'^'^, there is a function 
(75 : w — > w such that if ^, i? C yj^Sf^""^, then there is a subset Bq C. B such that \Bo\ < 
gs{\A\) and AXboB. 

The relation is a a notion of independence if it satisfies Invariance, Preservation of algebraic 
dependence. Extension, and. 

Partial right-transtivity: Cq <^ C Q B , then AXcqB implies AX CqC &^ AX cB . 

but not necessarily Symmetry, (full) Transitivity or Local character. 
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In |10| . we showed that for T, T'™ as in the Main Setting the restriction to triples of finite 
sets in the definition of an independence relation is fully consistent with the usual definitions for 
independence relations. Further, we found that in this setting, there is no actual need to prove Local 
Character - it follows from the other axioms in this strong ("super") way. In the next sections, we 
will build, from the assumption of "efficient construbility," a notion of independence which is 
symmetric and transitive. From this, it will follow that T is rosy. The key theorem from |10| that 
we need here is the following; it is built upon on similar theorems of [7] and [2]: 

Theorem 1.5 (Rosy = "has an independence relation"). Let T, T'™, and K he as in the Main 
Setting. The following are equivalent: 

1. K is rosy. 

2. K admits an independence relation (without reference to Local Character). 

3. T'™ is rosy. 

4- r'™ admits an independence relation. 

5. r'™ admits a notion of independence with Symmetry and Transitivity. 

6. t'™ admits a notion of independence with Local character. 

(In either of the last two cases, that notion of independence is, then, an independence relation.) 

2 Definition of 'Efficient constructibility' 

As noted in the introduction, our remaining goal for this article is to show that "efficiently con- 
structible" classes K (arising as in the Main Setting along with T, T^™) are rosy by constructing a 
notion of independence with Symmetry and Transitivity. For this, of course, we will need to make 
several definitions leading up to the definition of efficient constructibility. 

We should also apologize to the reader for our use of the word "efficient" here. It is well-known 
to finite-model theorists that the logic FO+IFP - first-order logic extended with the inflationary 
fixed-point operator - captures relational-PT irae (see [1]); thus, the central role of the IFP-operator 
in our deflnition already suggests that the word "efficient" is more-or-less appropriate in that sense. 

From now on, we fix T, K and r'™ as in the Main Setting with a fixed finite relational signature 
Q, and SOT always denotes a countable model of T^™. We define 

&c\[m] = {acl(A) : A Cfin SOT} 

which is a family of finite sets. In this presentation, we will not make any real use of Morley-ization 
of the langauge in the sense of Appendix |Al 

Definition 2.1 (Infiationary fixed-points of expanded formulas). Let X'^''^ be a new relation symbol, 
not in Q. Then a formula, say ip(x) = ip{xQ, ...,Xr-i]X), of L[q U {X}] will be called a proper 
expanded formula. For A G acl[9JT], we define a subset ^°°[74] of A^ as follows: 

• i^^lA] = 
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• Given il)*[A] C A"", define 

tjj^+^lA] = ^^J^[A\ U {a G A'^ : {A,xj/[A]) N ^(a)} 
where (^4, is the gU {X}-expansion of A in which X is interpreted is V'*!^]- 

• i^°°[A] = [j,^^nA]=^\^\°''\A] 

Note that that for every t < oj, the set i/j*[A] and \ ip^[A] are first-order ^-definable sets. 

We also note that satisfaction, here, is evaluated with respect to the induced substructure A, rather 
than with respect to *H. 

Definition 2.2. A pre-program for K will consist of the data, 

n = (£'*^''*,e^,0 = {Oi}i<No,^ = {^i}i<N,^ = {V'<T,Ca}a;*^2,y = (yo, ^r-l)) 

as follows: 

• £1*^** = q\J {-'^o'^'*, is an expansion of q and for each i < N, 9i{xo, ...,Xr-i; X^^^) is 
a proper expanded 3-formula of L[gU {Xi}]. 

• $ = {(^Q^ ...^ tpN-i} is a family of sentences of L[g*^**]. 

• Q'^ = qL}{Y^'^^} is another expansion of g; for each map cr : $ ^ 2, iI)„{xq, Xr-i) is a proper 
expanded 3-formula of -Z^f^"^]; Ca{y,zo, ■■■Zm-i) is an arbitrary formula of i^[£>"'"]. 

For A Cfin SPT, we write for the expansion {A,9^[A], ...,0'^^_^[A\), and we write A"' for the 

expansion (^, V'^f^]). 

Suppose A = acl(^) Cfin Write aA for the function $ ^ 2 given by 

= 1 ^ N ipi 

for each i < N. We say that vl requests attention at {oq, ...,ar-i) G A'" just in case, 

a e ^ZlA] and A'^^ N -3y^^^(y,ao, ...,a^-i). 

If A does not request attention at any a E: A^, then we say that A is H-complete. Finally, 11 is 
genuinely a pre-program for IK if for any A G acl[9!)t] 

A is n-complete iff yl 1= T (iff A G . 

Definition 2.3. Let 11 = (^*^'**, 0, S, y) be a pre-program for K, and let N T. A command 
operator for 11 is a map, 

F : acl[M] V&niS^) 
satisfying the following whenever A = acl''^ (^4) Cfin M. 

• If A is n-complete, then F{A) = 0. 
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• Suppose A is Il-incomplete. Up to an a priori fixed linear order of quantifier-free r-types of 
^test Q^gj, 0^ suppose A requests attention at a G A^ , and vr = qtp{a; is minimal for this 
condition. (In what follows, we will say that (11, F) acts on A at a.) Then F{A) is of the 
form, 

{{bc,c)/E : c £ Req{aA,a)} 
where E = -£"0-^,77 is a 0-definable equivalence relation on r-tuples and 

Req(a^,a) = {c G A'' : A'^^' h 7T{c)kA^^ N ^3yC^^iy,c)} 

and be G for each c G Req(c7^, a). Finally, we require that given a a, the pairs (c, {bc,c)/E^ 
are elements of an a'-definable relation (though not necessarily a function) for some a' 1= vr in 
A. 

Together, (IT, F) is a program for K, which then defines an operator 

eval = evaln,F '■ acl[9Jt] — > acl[9Jt] 

as follows: For A G acl[5[t], 

eval(A) = acl {A U F{A)) n M 
For A = A-i Cfin M, (II, F) induces a sequence eval*(yl) = (ylj)i<i^ as follows: 

^0 = acl-'^(A_i), Ai+i = eval(Aj) for ah i < u. 

Definition 2.4. We say that K is efficiently constructihle if there is a program (FT, F) for K such 
that the following conditions are satisfied: 

1. There is a polynomial p{t) G N[t] such that for any A Cfin if (Ai)i<Lj = eval*(A), then 
Ai = Api^\^A\) whenever p(|A|) <i<oo. 

(We may then define An,F := Ap(|yl|)i a-^d we note that A < Aji.f ^fin M and An,F £ ^0 

2. (n, F) is weakly automorphism invariant: For A, B Cfin 9?^, if there is an elementary bijection 
f : A B, then Aji^f — -Bn,F via an isomorphism extending /. 

3. For all A C S Cgn M, there is an automorphism g G Aut{M/A) such that g[An,F] ^ Bji p- 
2.1 Induction graphs and construction graphs 

Much of the proof of theorem ?? consists in defining a certain family of directed acyclic graphs 
(DAGs) - construction graphs - that represent all of the relevant model-theoretic information in 
runs of a program (H, F). (See Appendix [B] for background on DAGs and d-separation.) Eventually, 
we will recover an independence relation K (hence for T''™) by synthesizing two previously existing 
ideas. The first of these is that of d-separation in DAGs, which is well-known in statistical learning 
theory under the rubric of "graphical models." In a fixed finite DAG, d-separation has several prop- 
erties in common with model-theoretic independence relations (although Extension and Invariance 
don't make any sense there). The second idea is the definition of t>-indepedence, Jj?, found in [2]: 

• AA^B iff acl(ylC") n acl(5C") = acl(C") whenever C C C" C acl(SC) 



9 



• A^cB iff for every D 5 BC, there is an A' =bc A such that A' S^qD. 

I M 

In effect, we will just replace X by a perturbation of d-separation in our construction graphs and 
play the same game as in [2j. A certain easy observation about the definability of our construction 
graphs (in W = OJI'^'^) will guarantee that we can win this game. 

Finally, we must remark that our construction seems to be only loosely tied to the original K- 
Construction problem (or Strong L'^-Canonization). Accordingly, it may be of some more general 
interest to investigate a larger world of connections between rosiness in, say, ^Q-categorical theories 
and constructibility conditions defined using inflationary fixed-points of formulas. 

Definition 2.5. Let ip{xo, ...,Xr-i; X'^''^) be a formula of L[gU{X}]. Let A Cfin M, R C A"" and 

R' = RU{ae A" : {A, R) h <f{a)} . 

For a,b € A^ , we define 

"6lh'^ a G i?' w.r.t. {A,Ry 

to mean that: 

L a e R' \ R and beR; 

2. {A,R) \= ip{a); 

3. (A, ^ ^ip{a) where R^^ = R\ acl(6)^ 

Assuming </5°°[^] = define IGip[A,e] to be the directed graph with vertex set e+1 x A^ 

and edge set IJ^^g e], where for each t < e, 



{ {{t,a),{t+l,a)):aeA'} 

{ ((t,6),(t+l,o)) :b\^^a£^'+^A] w.r.t. {A,^'[A])} 



E'^[A,e]=[j 

Definition 2.6. To define the induction graph of H on A, denoted /Gn[^], we first set 



= min < 



e: /\ er[A]=enA]\. 

i<No 



Then has vertex set x and edge set 

En[A]= U EeAAe] 

i<No 

Now, let a = aA and ca = min{e : ip^[A] = ip^[A]}. Then/Gn[^], with vertex set (e^+e^+l) x^'', 
is obtained by gluing IG^^ [A, ca] to the end of in the natural way. 

Definition 2.7. Let A Cgn Tl. We first define CG°[A] = GG^ p[A] as foUows: 

1. Let eval*(^) = (Aj)j<^ and set p = p{\A\). (For the statement here, we assume that Ai is 
Il-incomplete for each i > p(\A\).) 
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2. For each z = 0, 1, ...,p — l, we define an enlargement IGn[^i]^ of the induction graph 7Gn[^i]- 
Assuming (n,F) acts on Ai at aj G A[ and setting tTj = qtp{ai; Af^^^) , we define 

d = {ci e : A^* N 7ri(c)&<^^ N ^3y ^,^^{y,c)] 

and let c i— )■ {dc,c)/E be the map associated with the application of F. To make £'n[Ai, a^]^, 
we just add a new layer to £'n[^i]: the set of new vertices is {*i} x A'^_^_^, and the new edges 
are of the form 

{{e''{Ai)+eA,+l,c),i*i,b)) 
for ceCi and be Ar^^ n acl^"' ( (dc, c)/E). 

3. We obtain CG^[A\ be gluing/identifying each vertex (★j, a) of £'n[^j]^ with the vertex (0,a) 
of En[Ai+i,ai+i]+. 

Next, we define the construction graph 00(^4] by pruning as follows: 

1. CG[74] retains only the following vertices: 

• The vertices of the very first layer (those from the first layer of -En[^o]^)) which we now 
write as (★_i,a). 

• Those of the form (★j , h) for some i <p 

2. For i = —1, 0, ...,p — 1, ( ) is an edge of CG[^] if one of the following holds: 

• rng{a) = rng(b); 

• There is a (directed) path from (^^,0) to (★j+ijfe) in Eu[Ai,ai]'^ , and a G acl(6). 

We will need to do a bit more bookkeeping in our construction graphs than we can easily express 
without some further machinery. Thus, we define a certain "tuple-labeling function" that connects 
everything produced in a run of the program (11, F) on A Cfin M back to the original set A. From 
this, we define the notion of "hereditary descendants" in our construction graphs, which will be 
indispensable in both the definition and the analysis of our independence relation. 

Definition 2.8. Tuple-labehng maps A"^ : y(CG[yl]) — > H{A) and hereditary descendents. 
For a non-empty set X, we define H{X) as follows: 

• HoiX) = ({0} X X)<- 

• Hn+i{X) = {( (n+l,ao), (n+l,afe_i) ) : ao, ...,afc_i G Hn{X),{) < k < uj} 
. H{X) = \J^^^H^{X). 

We define : V{CG^[A]) H{A) as follows. 

• Let Aq be just (★_i,o) ^ ((0, oq), (0,ar-i)) 

• Given A^: for a vertex (*j_|_i,5), let (*i, ao), {-ki^an-i) enumerate (^j+i, 5)'s predecessors in 
CG[A]. Then set 

Af+i(*i+i,6) = ((i+l,Af(ao)),...,(i+l,Af(an-i))) . 
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• Set A^ = U<pAf 

Finally, if Aq C A, we define the set of hereditary descendants of Aq with respect to A, hdesc(74o; ^) 
inductively as follows: 

1. {★_!} X C hdesc(^o;-4) 

2. If A^(*i+i,6) = ((i+l,_Af (*i,ao)),...,(i+l,Af(*i,a„_i))) and (*i,aj) € hdesc(Ao;A) for 
some j < n, then € hdesc(Ao;^) 

The following observation is very important, even though it follows very easily from noticing 
that (i) finitely many iterations of the inflationary fixed-point operation amount to a first-order 
formula, and (ii) given A Cfin each of the hypothetical scenarios A B Cfin ^ can be encoded 
as a first-order statement. 

Observation 2.9 (Definability of construction graphs). For each k < u, there is a finite family of 
Xq, X!^^_-^ of sorts of TT^ such that the following holds: Let A Q B Cfin StJl- Then for any 
g G Aut(dyi/A) such that 5r[j4n,F] ^ Byi^f, there is a mapping, 

: hdesc(yl;i?) dcF^(g[An,F]) 

satisfying the following conditions: 

1. lik^ i, then {Xf n {X'^}j<n, = 

2. For each k < p{\B\) and (*fc,a) G ({★fcjxBp n hdesc(^; B), f'^{*k,a) G U • • • UX^^.^ 

3. For each k < p{\B\), and (*fc, ai), 02) ^ {*fc}x-B^ H hdesc(A; B), 

tp(/^(*fc,a2)M) = tp(/^K,a2)M) in 93T = 9Jt^^ 

implies 

tp3(ai;i?r)=tp3(a2;BD 

where tp3(— ; Bjf^^) indicates a partial type consisting of existential formulas with satisfaction 
evaluated in the finite structure B^^^^ . 

4. For any k < p{\B\) and {*k,a) G {{-kk} x Bl) nhdesc{A; B), acl(a) C acl {AU {f^{k,a)}) 

We derive the following lemma as an immediate consequence of Observation 12.91 and the fact 
that T is b^o-categorical, hence uniformly locally finite. We refer the reader to Appendix |B] for the 
notion of d-separation and the notation [— U — |— ]. 

Lemma 2.10. There is function loc : uj x lj ^ oj such that for all A,B C C Cfin and any 
Co, ...,Cn G C" with n > loc(|A|, \B\), if: 

[(★,Ci) n {-k,cj) I hdesc(^B; C)]c.G[q 

for all i < j < n (setting ★ = forevermore) , then 

[(*,Ci) n Wx^*- I \Ldesc{B;C)]cG[c] 

for some i < n. 



12 



3 Deviation and (i-Independence 

In this section, we use the construction graphs developed above to define an independence relation 
J5 on our eflttciently constructible class K. To do this, we first make a perturbation J, - called local 
separation, for lack of inspiration - of the d-separation relation; thereafter, we use local separation 
as a substitute for the X relation of [2], and we proceed along similar lines. 

Definition 3.1. Let A,B,C C D = acl(D) Cfin M. We say that A,B are locally separated over C 
in D - denoted A ic B / D - \i 



[{★}xA'^ n MxS'' I hdesc(acl((7');i:') 



CG\D] 



whenever C C C" C acl(SC). 

Again with B,C ^ D Cgn though D need not be algebraically closed, let 7r(x) be a partial 
type over BC. We define Ao[vr(x), C]d to be the set of finite sets D C D' £ acl[A^] such that: 

1. tt{x) is realized in D'; 

2. For every realization a of tt(x) in D' , 

aiBcD/D' alcD/D' 
(using a as shorthand for the set rng(a)). 



We define A[7r, C]d to the be the set of isomorphism types over D appearing in Ao[vr, C]d, and we 
call A[7r,C]£) the deviation of it over C with respect to D. 

Finally, we define A^^cB to mean that A[p, CJd is finite whenever ABC C D Cfin P = 
tp(a/BC) and a is an enumeration of A. We also say, then, that p does not deviate over C. 



Lemma 3.2. Let A,B,C (1 D Cfin M where D = &c\{D). 

I-Monotonicity; If A ic B/D and Bq C B, then A ic Bq/D. 

4-Base-monotonicity; If A J,c B/D and Bq C B, then A irCBo B/D. 

Preservation of algebraic dependence If A B/D, then acl(AC) fl acl(-BC) = acl(C). 

Proof of ^-Monotonicity. Assuming A B/D, we consider C Q X CI acl{BoC). Clearly, C C X C 
acl(SC), so 

[{★}xA'' n {★}xS'~ I hdesc{s.cl{X); D)]cG[D] ■ 
By the monotonicity of [ — U — | — ] (see the appendix) , we find 

[{*}xA'' U WxSj I hdesc(acl(X);L')]^g,p] 

and this suffices. □ 

Proof of ^-Base-monotonicity. Assuming A J,c B/D, we now consider BqC ^ X CL acl(-BC). 
Again, C C X C acl(5C), so 

[{★}xA'' n HxS'" I \idesc[a.c\{X);D)](jG[D\ 
follows from the assumption. □ 
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Proof of preservation of algebraic dependence. For the contrapositive, suppose e € acl(^C)nacl(i?C) 
and e ^ acl(C). Then (e, ...,e) witnesses, 

n times 

[{★}xA^ ^{*}xS^ I hdesc(acl(X);Z?)](^(.p] 
so AlcB/D. □ 

In preparation for some sUghtly more complex arguments, we collect a few useful observations 

Observation 3.3. Let B,C D Q Di Q E acl[A^], and let itq{x) C 7r(x) be partial types over 
BC. 

1. If ^ G A[7r, C]d, then E G A[7r, C]d, 

2. If ^ G A[7ro, C]d, then ^ G A[7r, C]d- 

Observation 3.4. Let 7r(lE) be a partial type over C Cfin M. If every complete extension of vr to C 
is algebraic, then tt is algebraic. 

The following proposition just collects together what we know of J5 so far; that is, J5 is a notion 
of independence - if not an independence relation - even if the underlying program (11, F) does not 
actually witness efficient constructibility. 

Proposition 3.5. JJ^ has the following properties: 

Invariance; If A, B, C Cfin M and a G Aut{M), then A^qB <^ a[A]^^[c]'^[B]. 

Preservation of algebraic dependence.' For A,B,C CfinM, if AA!^cB, then 

acl(AC) n acl(SC7) = acl(C). 

Existence.- If A,C Cfin M, then AA!^cC- 

Extension.' For A, B, Bi,C Cfin M with BC C Bi, if A^cE, then there is an A' =bc ^ 
such that A'^cBi- 

Partial right-transitivity.' For A, Bq, B,C Cfin M with Bq C B, if AjifcE, then Ajj^c'i^o 0,''^^ 
AA^cBoB- 

Partial left-transitivity; For A, Aq, B,C Cfin M with Aq C A, IfAo^cB and A^^cAqB , then 
Al'cB. 

Both Invariance and Existence are self-evident from the definitions, and Preservation-of-algebraic- 
dependence for is an easy consequence of Preservation-of-algebraic-dependence for J,. Partial 
left-transitivity is somewhat more involved, so we defer its proof to later in this section; the proof 
wants for Lemma [3/71 below. 
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Proof of Extension. Let A, B, Bi,C Cgn ^ with BC C Bi, and suppose A^qB- Let a be an 
enumeration of j4, and let = tp{a/BC). By the b^Q-categoricity of T = T/i(A^), let •••iPn-i £ 
S{Bi) be an enumeration of the complete extensions of p to Bi. Towards a contradiction, suppose 
that for each i < n, there is some Bi C Cgn M such that A[pj, C]d- is infinite. Without loss of 
generality, we may assume that A[pi, C]d is infinite for each i < n, where D = Dq U • • • U Dn-i- 
It follows that A|j», C]/) is infinite, which contradicts the AXycB- Hence, we may certainly choose 
some Pi and a realization a! 1= pi such that a'J^c-^i, as desired. □ 

Proof of Partial right-transitivity. Let A, B, Bo,C Cgn M with Bq C B, and suppose AXcB. We 
show that ylJ^cBo-^- ^ ^ acl[A^]i:) \ Ao[tp(a/i3C), C]/5, where BC D C E and a enumerates 

A. We obtain an ai G ii^l"' such that ai \= tp{a/BC), ai Ibc D/E and oi ic D/E. BY l-base- 
monotonicity, we have ai icBo D/E., so ai violates the condition, "ai icBo D/E ailcD/E.'" 
We have shown, then, that 

acl[X]D \ Ao[tp(a/BC),C]B C acl[M]B \ Aq [tp (a/SC), CSqId 

and with AJ5c-S, it follows that A^cBqB., as desired. The proof that AJ^c-Sq is similar, so we 
omit that portion. □ 

Lemma 3.6. Let A,B,C Cfin M where < r, and suppose AScB but BXcA. Then 

B ^ acl(^C). 

Proof. Towards a contradiction, suppose B C acl(^C). Let a and b be enumerations of A and 

B, respectively. From the hypothesis bJLcA, we may take A[p{a,y),C]iya to be infinite, where 
p = tp{ab/C) and COD Cfin M \ A. From preservation of algebraic dependence, we know that 
bJ^AC^D. Applying Extension and Invariance repeatedly, we implement the following construction: 
Construction: 

Stage 0: Set oq = a 

Stage At the beginning of stage z + 1, we have oq, ■■■,a.i, pairwise distinct, such that for 
each j < i, 



«j =BD oo; 

2js 



— aj\y BD DbaQ...aj^i. 



We then choose Oj+i =b)M^^ ^ such that Cj+i J5_bd -D6ao...aj. 

Since A[p(lE, b),C]E is finite for almost every E containing Cb (up to automorphisms over Cb), and 
applying Ramsey's theorem and the ^^o-categoricity of T, we may also choose a chain Eq C Ei C 
• • • Q Ei Q ■ ■ ■ Cfin M of algebraically closed sets such that 

[(★,ai) U {•k,aj) \ hdesc{acl{Db); Ek](jQ^^^^ 

and 

[(★, Oi) ^ I hdesc(acl(i:)); E^fe)] cG[Ek] 
whenever i < j < k < uj. This contradicts Lemma 12.101 so this lemma is proven. □ 
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Lemma 13.71 now follows from Lemma 13.61 and Observation 13.41 

Lemma 3.7. Let C Cfin M and a, 6 G M''' , and let p{x,y) = tp{ab/C). Let Cb C D Cfin M, and 
suppose A\p{x,b),C]D is infinite. If q{x,b) is a complete extension of p{x,b) to D which does not 
deviate over Cb, then q{x, b) is algebraic. 

Proof of Partial left-transitivity. Assume ai^cB and a2-^CaiB, and towards a contradiction, sup- 
pose aia2J!^cB , where ai,a2 ^ M^"^ and BC Cgn By a straightforward argument using the 
Extension, we may work under the assumption that A[tp(aia2 / BC) , C]bc is infinite. By Existence 
and the preceding lemma, we know that tp{aia2/BC) must be algebraic - i.e. aia2 € acl(i?C). For 
the contradiction, then, we will derive that tp{aia2/ BC) is non- algebraic. 

For a (local) contradiction, suppose tp{aia2/BC) is algebraic. From preservation of algebraic 
dependence, one can show that aia2-i!^cB implies aia2 ^ acl(C), and it follows that tp(ai/C) and 
tpiji2 / Cai) are not both algebraic. Similarly, as tp(aia2/BC) is algebraic, it cannot be that both 
tpiai/ BC) and tp{li2/ BCHi) are non-algebraic. If tp{ai/ BC) is algebraic, then we have aiXc-^i 
and if tp{a2/BCai) is algebraic, then a2-^CaiB - in either case, a contradiction. Thus, tp{aia2/BC) 
must be non-algebraic - the higher level contradiction - which completes the proof. □ 

Theorem 3.8. d5 is symmetric and fully transitive. With Proposition \3.5\. J5 is an independence 
relation for K , so K is rosy. 

Proof. By partial right- and left-transitivity, we need only demonstrate symmetry. Let C Cfin M 
and a,b £ M^'^ , and assume aj5c^- Towards a contradiction, we suppose that foj^ca - that 
is, let C C L) Cfin M \ rng{a) such that A[tp(b/Ca),C]i:)a is infinite. By lemma [3?6| we know 
that p{a,y) = tp{b/Ca) is non-algebraic, so p{a,y) has a non-algebraic extension q{a,y) to Da, 
which by lemma 13. 7| must deviate over Ca. Thus, we can choose D D' Cfin M \ rng(a) such 
that A[q(a,y),Ca]iya is infinite. Using the construction from the proof of lemma |3^ we obtain 
a sequence {bi,Ei)i^^ such that {bi)i<^uj is acl(L''a)-indiscernible, bo =D'a b and for every i < uj, 
D'aD'abo...bi^i and 

bi iDa D'a/Ej ^bil caD'a/Ej 

whenever i < j < uo. By Ramsey's theorem, again, we may then assume that either bi / jjaD'a/Ej 
for all i < j < UJ, oi bil caD'a/ Ej for all i < j < to. In either case, we derive a contradiction to 
lemma Eini □ 
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A The complete invariant for L and game tableaux 

In this subsection, we introduce both the complete invariant for A:-variable theories of finite q- 
structures and the notion of a game tableau for the L'^-theory of a fixed finite ^?-structure. The 
latter is key in much of our analysis of computational problems around finite-variable logic. The 
invariant is not itself terribly useful in this work, but as well as providing the starting point for game 
tableaux, this is the finite-encoding of L'^-theories mentioned in the introduction. The material on 
the complete invariant can be found in |13| or [T3], and the material on game-tableaux can be found 
(with non-trivial differences) in |14| . where it is deployed only in relation to the 2-variable logic. 

Suppose is a finite ^?-structure. Then, the quotient structure is essentially synony- 

mous with the set S^(T) of A;- variable A;- types of T, where T = Th^{M) is the complete A:-variable 
theory of M. Moreover, if the set of quantifer-free /c-types of g is endowed (a priori but arbitrarily) 
with a linear order, then there is an ='^-invariant algorithm computing a mapping 

fin[£>] fin[{<}] : M ^ (iVV=^ <^) 

where [N^ /=^, <-^) is a linear orderU this algorithm has running-time lA^I*^'-'^-* (see |14| or |13)). 
Now, for each complete quantifier-free /c-type 6(xi, ) of Q, let yj^^ be a new unary predicate 

symbol; for each permutation a E Sym [k], let P^^^ be a new binary predicate symbol; and let Acc^'^^ 
be an additional unary predicate symbol. Let be the signature consisting of these symbols 
together with the binary relation symbol <. Given a finite ^?-structure M, then, we define I^{A4) 
to be a ^)™^-structure with universe / =^ as follows: 

• <^^''(-^) is the linear order, <-^, of described above. 

• y/ = {a/=^ : a £ , M ^ Oia)] for each quantifier-free A;-type 6. 

• For a € Sym[k], we put {a/= ,b/= ) € Po- just in case, 

(7W,(a<,(i),...,a^(fc))) {M,b). 

• For a,b ^ we put (a/=^, h/=^) £ Acc^ ^'^^ just in case there is an element m G M, such 
that 

{M,{m,a2,...,ak)) =^ {M,b). 

Equivalently, (a/='^, 6/='^) G Acc^''^^^ if for every (a^,...,a'^) E a/='', there is an m' G M 
such that 

(A^,(m',a'2,...,4)) ^'^ {M,b). 
This operator is known in the literature as the complete invariant for A;-variable logic. 

Theorem A.l. (ml) Let M and M he finite g-structures. Then M =^ N if and only if I^[M) = 
I^{M). Moreover, the mapping A4 i— )• l'^{A4) is computable by a relational Turing machine with 
running-time \M\'^^^\ 

^ =''-Invariance, here, means that if M AT, then (i) {M'' /='' , <-^) and {N*" /='' ,<^) have the same (finite) 
length, and (ii) if a G M'' and b e N'' are such that tp'{a;M) = tp'(b;M), then a/='' and b/='' have the same 
position in the orderings. 
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We note that since I^{Ai) is linearly ordered, there is a canonically isomorphic ^p"^^-structure 
with universe [n] := {l,...,n}, where n = < uj, in which the hncar order is the standard 

one, and this transformation is computable in polynomial-time. Thus, it is not terribly abusive to 
write I^{M) = I^{N) instead of I^{M) = I^{J\f), with the understanding that we pass to this 
canonical/standard model. 

A.l Game tableaux and the amalgamation theorem 

The complete invariant carries much unnecessary information, which has the tendency to obscure 
what is really essential for analyzing the Strong L^-Canonization Problem. Thus, we reduce the 
complete invariant to a less informative structure called a game-tableau theory. As we have noted 
before, the quotient set is essentially synonymous with the set S^{T) of A;- variable A;-types 

of the theory T = Th^{Ml). Moreover, the accessibility relation between /c-variable /c-types of 
M is an invariant of T. That is, we may consider Acc C S^{T) x S^{T) such that for any g- 
structure J\f,ifJ\f\=T,beN'',p = tp^(b;N) and {p,q) G Acc, then there is a 6' G iV, such that 
62, A/") = g, and moreover, if b" G N and q' = tp^{b" ,b2, ■■■,bk\M), then {p,q') G Acc. 
Similarly, if cr G Sym[k] and tp''{a;M) = tp^(b\N), then 



In fact, as we shall shortly see, these facts together with some types in the language of equality 
effectively determine the class of models of the theory T. 

Let Mq be a fixed finite ^p-structure, and let T = Th^{AiQ). We enumerate S^{T) by ai (x), a]\[(x) 
ordered according to I^{Mo), where x = {xi, ...,Xk) is a tuple of pairwisc distinct variables. For each 
a G S^{T), let Ra ^ be a /c-ary relation symbol, and let fJ,a{x) be the unique complete (quantifier- free) 
fe-type in the language of equality such that T \= Vx(a(x) — )• iJ,a{x)). Let g'^ = {Ra ■ ol G ^^(T)}. 
We define T'^ to be the theory in the language of ^ consisting of the following assertions (which 
obviously comprise an V3-theory) : 





G2: The "type" R^ of a fe-tuple matches the equality type of the genuine type a: 




a 



We write a"^ for the unique type P such that 



T N \/xi...Xk (a(x) o P{xc,(i),..., 
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There is a pair of transformations, computable in relational polynomial-time, 

: fin[r] hii[T% - """^ : hii[T^] fin[r] 

which completely characterize the relationship between T and T'^jf] Firstly, suppose Ai G fin[T]; 
we define 7W*^ to be the ^>'^-structure with universe M and the obvious interpretations, 

R^"" = |a G m'' : t/(a; M) = a} 

for each a E S^{T). The fact that \= is an easy consequence of Theorem lA.ll Secondly, 
suppose 2t G fin[T'^] with universe A. For i?^'"^ G g, we set 

It is essentially trivial to show that the ^-structure 21 "^"'^ is well-defined and, indeed, a model of 
T. Collecting these facts, we have: 

Observation A. 2. The transformations — *^ and — are inverses of each other; that is to say, for 

any M G fin[r] and any 21 G fin[r<^], {M'^) = M and (21 '""'^ = 21. 

A model 21 of is called a ^arrie tableau for T, and the model 21 is sometimes called the 

realization of 21. Moreover, for a given model A4 of T, the structure Ai^ is called the g'ame tableau 
of Ai; thus, a model of T is the unique realization of its own game tableau. The theory T*^ is 
the theory of game tableaux of T. Abusing notation slightly, we will write Ty' for the sub-theory 
consisting of the axioms Gl through G4. 

Since there is nothing interesting to distinguish a model of T from its game tableau and the 
transformation is polynomial-time computable, it is not really necessary to distinguish between 
finite models of T and finite models T*-'; consequently, we will also dispense with the gothic script. 
In the next section, we will see that working with game tableaux makes a model-theoretic analysis 
much more tractable than would be the case in the original signature. The correspondence goes 
just a bit further in the following proposition (whose proof we omit because it is very simple). 

Proposition A. 3. Let A4 and J\f be models of T , and let A C M. For any mapping f : A N, 
the following are equivalent: 

1. f is a partial -elementary embedding M. M . 

2. f is a partial q'-' -isomorphism M'^ J\f^ . 

In particular, if is a model of T, then the complete quantifier-free type qtp{a; M'^) of a tuple 
o in the sense of A^*^ is equivalent, for our purposes, to the complete /c-variable type tp^{a; M). 

Capped theories and amalgamation 

We will say that T is a capped theory if for any finite model A of Ty', there is a finite model Q of 
such that A < G - that is, such that A is an induced substructure of Q. By proposition IA.3| 
any T for which the Strong L'^-Canonization problem is solvable must be capped. 

^Here and after, fin[T] and finfT''] denote the classes of finite models of T and T*^, respectively. 
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Lemma A. 4. Suppose A, and A4i are models ofT^. Suppose A is a substructure of both Jv[q 
and M.I, and Mq n Mi = A. Then there is a model C ofT^ and q*^ -emheddings gi : M-i C such 
that go= gi. 

Proof. The idea of the proof is to construct a sort of free-join oi ^4o and Mi over A. It will not 
be a genuine free-join because even may induce some additional equalities of elements, and the 
modified equality relation will then be a non-trivial equivalence relation, say E, on Mq U Mi. It's 
key, then, to maintain the condition E fl (Afj x Mj) = ^Mi- i < 2, in order to avoid obstructing 
the embeddings. It turns out that maintaining this invariant through the construction is actually 
sufficient to obtain the amalgam over A. 

Let Z = Mo U Ml, and let Ql = U for each a G S^{T). Furthermore, set = 
Z'^ \ (5° and Eq = Imq U 1mi • Suppose we are then given, 

Xg ^ Xg-i ^ ■ ■ ■ ^ Xq 

such that if c G and a G Sym[k], then (co-(i), ...,Co-(fc)) G Xg, and 

where Eg is an equivalence relation on Z such that Eg n (Mj x Mj) = 1 a/, for i = 0, 1. Let < t < fc, 
and let ci, ct G Mq and Q+i, G Mi such that c G X^. Let G S^{T) such that 

_^^mod 1= ^q(ci, ...,Ct,Ct, ...,Ct) 

and 

A^i"""*^ N 771 (q+1, ...,Cfe,Cfc, ...,Cfc). 

For brevity, we identify ?7o(^)) which asserts Ai=t+i = ^t; with the t-type it asserts on xi, ...,xt, 
and similarly for rji. We then take the following actions: 

1. Let a G S^{T) such that 

T N Vx(a(x) ?7o(a;i, ...,xt) /\r]i{xt+i, ...,Xk)) ■ 

Set 

Q^+i = Q^U(77o(M*)x,7i(Mi'=-*)) 
defining Q^t^ analogously for each a G iS'?/m[A;]. 

2. Let Eg-^i be the C-minimal equivalence relation on Z containing Eg and each {ci,Cj), i < t < j, 
such that T N Vx(a(x) — > Xj = Xj). 

Claim. We can choose a so that Eg+i fl (Mj x Mj) = 1^^, i = 0, 1. 

T^roo/ of claim. We prove the claim for i = 0; the other statement follows by symmetry. Note that 
we may assume s > 0. Suppose a,b G Mq, a ^ b and aEg+ib. We may assume that (a, b) G Eg^i\Eg 
and that aEgC and bEg^ic for some c G Mi. In particular, there are (w.l.o.g.) elements 

01 = a, 02, at' G Mo 
^,+1 = c,C2,...,Ck-t' G Ml 
61 = b,b2,...,bt G Mq 
ct+i = c,C2,...,Ck-t G Ml 
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such that at step s — 1, we acted on 

Co = tp\padk{ay,Mo'^°'^ ), Ci = tp'' {padk(c'); Mi""""^ ) 

and at step s (as above), wc acted on 

r?o = t/(pa4(b); A^o'""'' ), Vi = tp' {padkic); Mr""" )• 
Since Co A Ci 1^ a;i = Xt'+i and rio A rji \= xi = Xt+i, we now that 

tp\a;Mo"'°^ ) = i/(c;Mr°'* ) = tp\b;Mo"'°^ )■ 
As A^o is a model of T*^, there are a'l = a, ai^, ...a[ G Mq such that 

tp'{padk{a'y,Mr°'') = m- 

Again, because Mq is a model of T*^, there are dt+i, dfc G Mq such that tp^{padk{d); M.q^"'^ ) is 
equal to r^i. Now, 

A 771(d) ^ a = dt+i 

r]o{b) A r]i{d) b = dt+i 

so in fact, a = 6, a contradiction. □ 

Since Z is finite, there is a number n < oj such that X„ = 0. (In fact, n < \S^{T)\^.) Let C = Z/En, 
and for a G 5^(r), let 

i?^ = {{ci/En,...,Ck/En) : (ci,...,Cfe) G g^}. 

For i = 0, 1, define gi : Bi ^ C hy gi{b) = bjE^- It remains to verify that the triple {C^g^^gx) 
satisfies the requirements of the lemma. Gl: For each fc-tuple c = (ci,...,Cfc) G Z^ , either c G 
Mq U or c G \ Xg for some unique s < n; hence, c is certainly assigned a unique type. G2 
is immediate from the claim we proved above, and 03 follows directly from the construction. G4 is 
just plain old immediate. Finally, it's relatively easy to see that go and gi are g'^-embeddings that 
agree on A (in fact, each is the identity map on yl). □ 

The lemma, together with the assumption that T is a capped theory, easily yields the following 
very useful fact. 

Theorem A. 5 (Amalgamation theorem: AP/'models in fin[T'^]). Assume that T is capped. Suppose 
A, Aio and Ml are models ofT'^. Suppose A is a substructure of both Mo and Mi, and MoDMi = 
A. Then there are a model N ofT'^ and g'^-embeddings gi : Mi — >■ Af such that go = gi- 

B Directed acyclic graphs and d-separation. 

Definition B.l (Dags and descendants). We recall a few standard definitions around graphs. 

1. A digraph is a pair G = {V,E), where F is a nonempty set and E C V x V is such that 
{v,v) ^ E whenever v eV. 

The underlying (undirected) graph of G is, then, G = (V, E) where E = {{u, v} : (u, v) G E}. 

Where convenient, we will understand digraphs as structures with signature {iZ*-^)}; so, if 
G = {V, E) is a digraph, the associate {iZ}-structure is G = (F, RP), where R'^ = E. 
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2. A path in G is a sequence (vi, ■■■,Vn) from V^^ such that (fj, v^+i) € E for each i = 1, n — 1. 

Similarly, path in G is a sequence {vi,...,Vn) from V^'^ such that € for each 

i = I, n — 1. 

3. The digraph G is acyclic if there is no path {vi,...,Vn) in G such that vi = Vn- (Note that 
this does not imply that G is acyclic.) 

An acyclic digraph G is also called a dag (for "directed acyclic graph"). 

4. In a dag G, we say that v V is a (proper) descendant of u V just in case there is a path 
{vi, Vn) in G such that = n and w„ = f . For a subset X (^V, we define 



It can be shown that G = {V, E) is a dag if and only if the transitive closure of is a partial 
order, say <e, of V] in this scenario, then, descG(X) = {v :3x ^ X.x <e v} 

5. Assuming G is a dag, a trail in G is a sequence (vi, Vn) from which is a path in the G. 
That is, for each i = l,...,n — 1, either {vi,Vi^i) € or (wj+i,Wj) G R'-' . (Not both because 
G is acyclic.) 

Definition B.2 (d-Separation in dags). Let G = {G,R^) be a dag, and let G = {G,R^) be the 
underlying undirected graph of G. 

1. Let t = (wi, ■;;„) be a trail in G, and let i € {2, n — 1}. 

(a) t is head-to-tail at Vi if either G 1= i?(i'i_i, Wj) Ai?(t'i, fj+i) or G 1= i?(wi+i, Wj) Ai?(fi, Wj-i). 

(b) t is tail-to-tail at if G 1= R{vi,Vi-i) A i2(i;j, Uj+i). 

(c) t is head-to-head at if G 1= R{vi-iVi) A i?(fj+i,fi). 

2. Let Z Q G, and let t = (wi, ...,fn) be a trail in G. We say that t is Z-blocked if for some 
i G {2, ...,n — 1}, one of the following holds: 

(a) t is head-to-tail or tail-to-tail at Vi and Vi € Z; 

(b) t is head-to- head at Vi and Z fl desccdvi}) = 0. 

3. Let X,Y, Z Q G. We say that X and Y are d-separated by Z (in G) ii 

(a) Xr\Y QZ- 

(b) For every trail t = (f i, Wn) in G such that vi £ X \ Z and f„ G y \ Z, t is Z-blocked. 

We write [X II y | Z]^ to indicate that X and Y are d-separated by Z in G. 

The notion of d-separation seems to have arisen, originally, in statistical learning theory - as in 
|16j . In that domain, d-separation in a finite dag G = {V, E) is a means of representing conditional 
independence under a joint probability distribution p on a system of random variables (X^)^gi/. Let 
parentSg.(f ) = {u G F : (n, w) G i?}, the graph structure is taken as a synonym for the assertion. 




{v : V is a descendant of x} . 



p((X„ = a^)^) = Jl p{X, 
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One might, then, be interested in the conditional independence properties of marginals of p. Let 
us write p{A = a) as shorthand for the marginal distribution p{{X^ = a„)„g^), where A V 
and a € riuGy ^'^ that each v € V corresponds to a St,-valued random variable. Then, given 
A,B,CC V, one might ask if p{A = a,B = b\C = c) = p{A = a\C = c) ■ p{B = h\C = c). It can 
be shown, as in [T6] and [3], that if A,B,C are pairwise disjoint, then p{A = a, B = b\C = c) = 
p{A = a\C = c) ■ p{B = b\C = c), for aU a, b, c, if and only if 11 S | C]g holds. 

Theorem B.3. Let G = {G,R^) be a dag. 

0: [X\1Y\Z]g ^ Xr\Y (^Z ^[{X\Y)\1{Y\Z)\Z]G 

Symmetry.- [XJ1Y\Z]g [YUX\Z]g 

Monotonicity; [X UY \ Z]g /\Yo Q Y =^ [XUYo\Z]g 

Base-monotonicity.- [X UY \ Z]g /\Yo ^ Y =^ [X U {Y \Yo) \ Z UYo]g 

Triviality; [XUYi\ Z]g A[XUYi\ Z]g [X U Fi U I Z]g 

The names of the properties are chosen, here, to correspond to those similar properties of model- 
theoretic independence relations, and consequently, they are not the names used in the statistical 
learning theory literature. In the latter, the names for Symmetry, Monotonicity, Base-monotonicity, 
and Triviality are, respectively. Symmetry, Decomposition, Weak Union, and Contraction. 
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